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Oj. Abstract 

We give an arithmetic version of the recent proof of the triangle removal lemma by Fox 

Q\ [ |Foxllj . for the group F£. 

A triangle in FJ? is a tuple (x, y, z) such that x + y + z = 0. The triangle removal lemma 
for FJ? states that for every e > there is a S > 0, such that if a subset A of FJ? requires the 
removal of at least e • 2™ elements to make it triangle-free, then it must contain at least 5 ■ 2 2n 
triangles. We give a direct proof which gives an improved lower bound for 5 (as a function of 
e), analogous to the one obtained by Fox for triangle removal in graphs. 

_C The improved lower bound was already known to follow (for triangle- removal in all groups) , 

using Fox's removal lemma for directed cycles and a reduction by Krai, Serra and Vena [KSV09 
(see |Foxll[ ICF12J ). The purpose of this note is to provide a direct Fourier-analytic proof for 
the group F£ . 
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1 Introduction 

The triangle removal lemma for graphs states that for every e > 0, there exists a 5 > such that 
for every graph on n vertices with at most 5n 3 triangles, one can remove all triangles by deleting at 



most en 2 edges. Contrapositively, this means that if a graph is at most e-far from being triangle-free 
i.e., one needs to delete more than en 2 edges to make it triangle free, then it must have at least 
Sn 3 triangles. 

The lemma was originally proven by Ruzsa and Szemeredi JRS78] with a bound of 5 > 
l/Tower(poly(l/e)), where Tower(i) denotes a tower of twos of height i, i.e., Tower(i) = 2 Tower (*~ 1 ) 
for % > 1, and Tower(O) = 1. This bound was only recently improved to 5 > l/Tower(0(log(l/e))) 
in a remarkable paper of Fox [Foxllj . 

The above lemma has a direct application to property testing since for a graph which is e-far from 
being triangle-free, a random triple of vertices is guaranteed to form a triangle with probability 
at least 5. Thus if we test 0(1/5) random triples of vertices, we will find a triangle with constant 
probability. This test can then distinguish such a graph from one which is triangle-free, since, for 
a triangle-free graph, the probability of the test succeeding is 0. The triangle removal lemma and 
its generalizations to removal of more general graphs (instead of a triangle) have several interesting 
applications in mathematics, and we refer the reader to the survey [CF12] for a detailed discussion. 
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An arithmetic version of the triangle removal problem was considered by Green |Gre05] . Let G be 
an Abelian group with \G\ = N and let A C G be an arbitrary subset. We call a triple (x, y, z) G A^ 
a triangle iff x + y + z = 0. Similar to the graph case, we say that A is e-far from being triangle- 
free if one needs to remove at least eN elements from A to make it triangle-free. Green proved 
an arithmetic analogue of Szemeredi's regularity lemma [Sz e75| . used in the proof of Ruzsa and 
Szemeredi, and proved a triangle removal lemma for Abelian groups. 

Theorem 1.1 (Green [Gre05j) For all e G (0,1], there exists a 5 > l/Tower(poly(l/e)) such 
that for an Abelian group G with \G\ = N, if a subset A C G is e-far from being triangle- free, then 
A must contain at least 5N 2 triangles. 

Other than being useful for proving the above result, Green's arithmetic analogue of Szemeredi's 
regularity lemma has had many other applications in combinatorics |GT10j and computer science 
|KO09] . In addition to leading to interesting analogues of combinatorial statements, the arith- 
metic setting has at least one more advantage: the proofs of these statements often proceed by 
partitioning the underlying spaces, and because of the structure provided by subgroups and cosets, 
the arguments are often cleaner in the group setting (specially for vector spaces over finite fields). 
This makes the arithmetic setting a convenient one to study for quantitative improvements to the 
results. 

In this paper, we present an arithmetic analogue of the proof by Fox, for the group F^. The 
argument can also be extended to other Abelian groups, but we restrict ourselves to Fg for simplicity. 



Theorem 1.2 For all e G (0, 1], there exists a 5 > l/Tbwer(0(log((l/e))) such that for all n G N 

and N = 2 n , any subset ACFj which is e-far from being triangle-free, must contain at least 5N 2 
triangles. 



We remark that the above result (for all groups) already follows from a version of the removal 
lemma for directed cycles, using a reduction by Krai, Serra and Vena [KSV09] . This was already 
observed by Fox [Foxll] (see also [CF12]). However, we believe some of the Fourier analytic notions 
that come up in a direct arithmetic proof might be of independent interest. Also, a direct proof 
makes it somewhat more transparent how the argument partitions the underlying group, which 
might be useful for further improvements. We present a sketch of the proof below. 

Known lower bounds for triangle removal. The original proof by Ruzsa and Szemeredi used 
Szemeredi's regularity lemma [Sze75| . However, the known lower bounds for the regularity lemma 
and its variants (see [Gow97| and |CF12j ) imply that this approach necessarily obtains a bound 
on 1/5 which is at least Tower(poly(l/e)). Fox's argument [Foxll] manages to avoid using the 
regularity lemma directly (although his proof still follows the same outline as the proof of the 
regularity lemma), thus obtaining 1/5 < Tbwer(0(log(l/£))). 

In terms of lower bounds on 1/5, it was shown by Alon [Alo02] that one must have 1/5 > 2 ( g v-' e >> 
for triangle removal in graphs. For the problem of triangle removal in groups, Bhattacharya and 
Xie [BX10] show that one must have 1/5 > (1/e) 09 . However, both these lower bounds are quite 
far from the known upper bounds on 1/5. 

1.1 Proof Sketch 

We will give a proof by contradiction. Let G denote the group FrJ, and assume, to the contrary, 
that we have a set A<Z G that is e-far from being triangle-free, and has at most <5|G| 2 triangles. We 



work with a subset i'Ci that is the union of a maximal collection of element disjoint triangles, 
i.e. no two triples representing triangles share a common element. Since A\A' is triangle-free, and 
A is e-far from being triangle-free, \A'\ > e\G\. Define Eq = l^'l/|G|. 

In the rest of the sketch, we denote A' by A, and £o by e. The proof is based on a potential argument. 
At every step in the proof, we have a partition of G into T cosets of a subgroup H ■< G where 

T = \ g /h\ cosets, and hence an induced partition of A according to cosets of H. We measure the 

mean entropy of the partition, defined as E„ 



/ f - — ijj-i J , where f(x) = x log x for x G (0, 1] 



V 1*1 J 

and /(0) = 0. Observe that the mean entropy is always non-positive, and by convexity it is at 
least eloge. Our main lemma proves that if 5 is much smaller than e 3 • g I 2 /t 2 , then we can shatter 
A, i.e., the current partition of A can be refined according to cosets of H' ^ H, such that the 
mean entropy increases by 0(e), and \ g /h'\ < 2 T '°y e ). In essence, the size of the partition, and 
hence the bound on 5, are one exponential larger at every step. Since the mean entropy is always 
non-positive, this process must stop after Oiloge" 1 ) rounds, giving the required bound on 8. 

In order to show that we can shatter A if it has too few triangles, it is convenient to equate 
A with its indicator function A : G — > {0,1}, and the number of triangles with the sum 
J2 x+V J rZ= o A(x)A(y)A(z) . Suppose we want to count the number of triangles between two cosets 
of H, viz. H + <7i and H + g2, and a third coset of H', H' + g^ + z, where gi + g2 + 93 = 0. 
Assume A has density s on all the three cosets. As a thought experiment, if A was the constant 
function e on H ' + g% + z, then the "number of triangles" between the three cosets is given by 
e\H\\H'\ E geH \ \ An (u'+9+9i)\ . \An(H<+g+ g2 +z)\ ^ _ Jf this ig significantly sma n er than e 3 \H\\H'\, then 

a Markov argument implies that if we partition according to cosets of ff', on a constant fraction of 
the cosets, A must have density significantly smaller than e, i.e. A is shattered. A defect version of 
Jensen's inequality then implies that the mean entropy of the new partition is larger by 0(e). 

Of course, A is not necessarily a constant function. However, a very similar argument works if 
all the non-zero Fourier coefficients of the function A on H' + g% + z are much smaller than e in 
absolute value - we call such functions as superregular, in analogy with the proof from [Foxllj . The 
last part of the proof is to find a superregular decomposition of A on H + g%, i.e. approximating 
An (H + g%) by a sum of superregular functions. If A is not superregular on H + 53, we pick a large 
Fourier coefficient n £ H, partition x G H according to the value of (x, 77), and restrict ourselves to 
the part with greater density. If this part is not superregular, we repeat this procedure. Since the 
density on any part is at most 1, this process must end with a superregular part. We remove this 
set from A and repeat the procedure until most of A is covered, to find the required superregular 
decomposition. 

This completes the proof sketch of Theorem 11.21 

Analogies and differences with the proof in [Foxllj . The proof of our main theorem on 
triangle-removal in groups is analogous to the proof of triangle-removal in graphs from the work of 
Fox [Foxllj . Nevertheless, we need to give the appropriate arithmetic analogues of the definitions 
and proofs. Though the proof in this paper is self-contained, for the readers who are familiar with 
the work of Fox [Foxllj . we point out the analogies and the differences between the two proofs. 

At every step in our proof, as in [Foxllj . we have a partition of the underlying set, the group 
G in our case. However, our partition is structured, and consists of all the cosets of one fixed 
subgroup, compared to an arbitrary partition in [FoxTTj . Our definitions of the potential function 
and shattering, are similar, except that we use the densities of the cosets, instead of the edge 



densities between pairs of subsets used in [Foxllj . Our notion of regularity is quite different from 
that in [ Foxllj . and is based on Fourier coefficients, similar to the one used in regularity lemmas for 
Abelian groups [Gre05| . The superregular decomposition that we find for a given set is analogous 
to the collection of superregular tuples in a graph, constructed in |Foxll| . 

2 Preliminaries 

Fix a positive integer n. Throughout the paper, we denote G = FJ? and N = \G\ = 2 n . The 
notation H ■< G denotes that H is a subgroup of G. We use H to denote the dual group of H. 
Denote by H 1 - the set of cosets of H in G. In some cases which will be clear from the context, by 
abuse of notation, we use H 1 - to denote a set of coset representatives. 

Given a set A C G, three elements x, y, z £ A are said to form a triangle if x + y + z = 0. A is said 
to be e-far from being triangle- free, if at least eN elements need to be removed from A in order 
that it contains no triangles. 

We abuse notation and use A to denote both the set A, and also its characteristic function A : G — > 
{0, 1}. Given a subgroup H ^ G, and an element g e G, we define A 9 H : H — > {0, 1} as 

A» H (x)^A(x + g). 

Let EjEgj^f-j denote the expectation when x is drawn uniformly from H. For a function / : H — >■ K., 
we define its Fourier coefficients as follows: For r/ £ H, define 

m d ^ e if(x) Xv (x)], 

where (•, •) denotes inner product over FJj . 

Jensen's inequality states that if / is a convex function, e\, . . . ,e s are nonnegative real numbers 

such that Yli=i £ i = ■*-' then, for any real numbers x\, . . . , x s , 

£i/(>i) H \- £sf(x s ) > /(eixi H h e s x s ). (1) 

Jensen's inequality immediately implies the following simple lemma, which will be useful later. 

Lemma 2.1 (Lemma 6, |Foxll| ) Let f : M>o — > M be a convex function, ei,...,e s and 

xi,...,x s be nonnegative real numbers with ^ie[si e « = !• F° r I != \ s \> ^ c = SiG/ e «> 
u = Yli£i £ i x i/ C > an d v = Sjefs]\/ e * rE */(l — c )- r/ten we /taue 

X)ei/(«i)>c/(«) + (l-c)/(T;). 

i€[s] 

2.1 Entropy 

Our proof of the main theorem will be based on a potential argument, where our potential will be 
the mean entropy of a partition, as defined below. 

Definition 2.2 (Entropy) Define the entropy function / : [0,1] — >• R as f{x) = xlogx, for 
xe (0,1], and /(0) = 0. 



Definition 2.3 (Mean Entropy of a Partition) Let H' ■< H < G, and g G G. Given a set 
A <^G, if we partition the coset H + g as cosets of H', define the mean entropy of this partition as 
follows 

Ql&H+g \ \H'\ ) gi &H \ \H'\ 

Analogously, the mean entropy of partitioning A on G according to cosets of H <G, is defined as 

T A (G,H) d ^ E ff\ An ^ H + 9i] 



gi eG V \H\ 

Remark 2.4 Assume A C G has been partitioned according to cosets of a subgroup H ■< G. For a 
refinement to this partition according to H' <H we have 

T A (G,H')= E f( lAn \%,, + 9)l )= ETa(H + 9 ,H'). 

g£G \ \H'\ J g£G 

In order to quantify the increase in the mean entropy because of a shattering, we need the following 
defect version of Jensen's inequality for the entropy function. We include a proof for completeness. 



Lemma 2.5 (Defect inequality for entropy. [Foxllj, Lemma 7) Let £i,...,e s , and 

xi,...,x a , be nonnegative real numbers with Ylie\s] e * = -*-> an< ^ a = X^e[sl e * x *- Suppose 
j3 < 1, and L C [s] is such that x% < (3a for all i £ L. Let c = X)tgj £ i- Then, 

J2 £ if( x i) >/(«) + (1 - P + /(£))«*• 

ie[s] 

Proof: We know that c < 1 since otherwise, a = J2i<i<s £ i x i = J2iei £iXi — @ a < a -> a 
contradiction. The cases when a or c are equal to follow immediately from Jensen's inequality 
(Equation ([T|), therefore we may assume that a,c/0. 

Letting u = \ Y^iei £ i x i and v = T^ 52i& £ i x ^ we nave > 

Y, £ if( x i) ^ cfiu) + (1 - c)f(v) = /(a) + caf(u/a) + (1 - c)af(v/a) 

l<i<s 

> f(a) + caf(u/a) + ca(l — u/a) = /(a) + (/(u/a) + 1 — u/ajca 

> /(a) + (1 - /3 + /(/3))ca, 

where the first inequality is from Lemma 12.11 the first equality follows from the definition of 
the entropy function /, the second inequality follows from the fact that /(|) = /( -rzi ) = 
/(l + - — 1_~ ) > JU1}£ ; anc j ^g i as t inequality follows from the fact that u/a < j3 and that 
f{x) + 1 — x is a decreasing function on the interval [0, 1]. ■ 

The following lemma shows how the mean entropy of a partition compares to that of its refinement. 

Lemma 2.6 (Entropy - Basic inequalities) Let H' < H < G, and ACG. 

1. > T A (G, H) > T A (G, G) = f (B 



2. For every g £ G, we have Fa(H + g,H') > Fa(H + g,H), and therefore Fa(G,H') > 
?a(G,H). 

Proof: Both the parts follow from convexity of /, and Jensen's inequality (Equation (HJ). ■ 

In Lemma [4. 2\ we will show how shattering a partition can substantially increase its mean entropy, 
which will allow us to use the mean entropy as a potential function. 

3 Super-regularity 

In this section, we define a notion of regularity, and show how one can approximate any set A as a 
sum of regular parts. 

Definition 3.1 (p-Superregularity) Let H ■< G, and g € G. Given a function f : H + g — > R, 
say that f is p-superregular on H + g if, for every n G H , rj ^ 0, we have, 

where the function ffj : H — > M is defined as fff{x) = f(x + g) for all x £ H. 



f 9 H (v) <P-f 9 H (o), 



Similar notions of regularity have been used in the proof of a Szemeredi type regularity lemma for 
¥'2 [Gre05j, and are well-studied as notions of pseudorandomness for subsets of abelian groups (e.g. 
see |CG92HGow98] ). 

Given a set A, the following lemma identifies a subset of A that is superregular. 

Lemma 3.2 (Finding Superregular Parts) Let H be a subgroup ofG. Given an element g £ G, 
a desired regularity parameter p E (0,1], a density parameter d > 0, and a set A C H + g such that 
\A\ > d\H\, we can find a triple (Ai,H\,zi) such that: 

1. Hi is a subgroup of H satisfying \ h /h x \ < 2 log ( 1 +p)( 1//d ). 

2. z\ € HHH^. 

3. A\ C A, Ai C Hi + z\ + g, and |Ai| > d\H x \. 

4- The indicator function of A\ restricted to H\ + z\ + g is p-superregular on Hi + z\ + g. 

Proof: We give an iterative procedure to find Ai,Hi and z\. Initialize A\ = A, Hi = H and 
z\ = 0. Observe that zi £ H D H^, and \Ai\ >d\H\\. 

If the indicator function of Ai restricted to Hi + g + zi is p-superregular on Hi + g + zi , we are 
done and we can return (Ai ,Hi,zi). 

Otherwise, we must have an r\ £ Hi,rj 7^ such that the Fourier coefficient \Ai 9 H Zl (t])\ > p ■ 
^ig 2l (0). Now, define H 2 to be H x D {h \ (h,n} = 0}. Thus \Hi/h 2 \ = 2. 

If \Ai n (H 2 + g + zi)\ > \Ai n (H 2 + g + zi + r})\, let z 2 = z\. Otherwise, let z 2 = zi + rj. Note 
that z 2 G HHH^. 

Define A2 to be A 2 = Ai (1 (Hi + g + z 2 ). Observe that we must have IjA > (1 + p) ■ rjjt, and 
hence \A2\ > d\H 2 \ in particular. 



Now, we let Hi = H 2 , A\ = A 2 and z\ = z 2 , and repeat the whole procedure. At every step, the 
triple (Ai, Hi, zi) satisfies properties 2 and 3. 

Since the density of Ai, i.e. \jf\, is increasing by a factor of (1 + p) at every step, and can be at 
most 1, in log( 1+p )(l/d) iterations we must find a triple such that the restriction of Ai to Hi + zi+g 
is /9-super regular on Hi + zi + g. Thus, the final triple satisfies property 4. Finally, property 1 also 
holds since at every step, l^/ffil increases by a factor of 2. ■ 

Repeatedly applying the above lemma gives us the following corollary, which allows us to decompose 
a set into superregular parts. 

Corollary 3.3 (Superregular Decomposition) Let H be a subgroup of G. Given an element 
g 6 G, a desired regularity parameter p £ (0,1], a density parameter d £ (0, 1], and a set A C H+g, 
we can find a positive integer t, and a collection oft triples (Ai,Hi,Zi), i = l,...,t, such that 
{Ai}j =1 are disjoint subsets of A satisfying \A\(A% U • • • At)\ < d\H\, and for every i = 1, . . . ,t: 

1. H is a subgroup of H satisfying \ H /Hi\ < 2 log ( 1 +p) (1 / d ). 

2. Zi eHDH^. 

3. Ai C A, AiCHi + Zi + g, and \Ai\ > d\Hi\. 

4- The indicator function of Ai restricted to Hi + Zi + g is p- superregular on Hi + Z{ + g. 

Proof: We build the collection of triples one at a time. At the i th step, let B = A\(AiU- ■ ■ Aj_i). 
If \B\ < d\H\, we can return the triples found till step (i — 1). 

So we can assume \B\ > d\H\, and apply Lemma 13.21 with subgroup H, element g, regularity 
parameter p, density parameter d, and subset B C H + g, to find (Ai,Hi, Zi) satisfying the required 
properties 1-4. Moreover, since Ai C A\(A\ U . . . -Aj-i), Ai is disjoint from A\, . . . ,Ai-\. 

We add the triple (Ai,Hi,Zi) to the collection and iterate. ■ 

4 Shattering 

In this section, we prove that if a set A contains few triangles from three cosets, then we can partition 
at least one of the cosets into parts with significantly varying densities. In order to formally state 
the concerned lemma, we need to define shattering. The following definition is similar to that used 
in Fox's proof of the triangle removal lemma [Foxllj . 

Definition 4.1 (Shattering) Let H be a subgroup of G. Given g € G and a set A C G, define 

d = - — ijj-i g " . Given a subgroup H' ■< H, and parameters a,/3 € (0,1], we say that H' (a,(3,k)- 
shatters A on H + g if: 

1. \ h /h'\ < 2 k , and 

2. P g/eH l\An(H' + g + g')\<(3d\H'\}>a. 

Namely, partitioning H + g according to cosets of H' results in significant variation in the density 
of A.' 



Once we find a shattering, the defect version of Jensen's inequality allows us to prove that the 
mean entropy of the partition increases. 

Lemma 4.2 (Entropy Increment) Let H' < H ^ G, and A C G. Let g £ G. If H' (a,(3,k)- 
shatters A on H + g then 

T A (H + g,H')>T A (H + g,H) + (l-(3 + f(f3))a- lAn \ I *. + 9)l . 

Lo | 

Proof: Follows from the definitions of J- a an d shattering, and applying Lemma 12.51 ■ 

Now, we can state the main lemma of this section. 

Lemma 4.3 (Shattering Lemma) Let H -< G, and gi,g2,93 G G such that gi + g2 + 93 = 0. 

We are given a set A<ZG with densities di,d,2, and d% on H + g\, H -\-g%, and H + g% respectively. 
Then either A contains ■vd\d2d%\H\ 2 triangles, or else, there is a subgroup H' < H such that H' 
(V 20 ) 3 / 4 ; l°gi+p ( 2 M)) shatters A on at least one of H + §2 or H + g%, where p = -^ A - 

We first prove some necessary lemmas, and then give a proof of the above lemma at the end of this 
section. The following lemma allows us to count the number of triangles between three different 
sets. 

Lemma 4.4 (Triangle Counting) Let H' -< H -< G. We are given gi,g2,93 G G such that 
91+92+93 = 0, and z\ G H. For any sets A, B, C C G, the number of triangles between A n (H' + 
gi + z\),B n (H + 52), and C n (H + #3) is given by, 

\H\\H'\ Yl A^P^ a )B§(a + V )C§(a + r,)xa +v (z 1 ). 

a£H~',ri£H> J -r\H 

Proof: We first observe that for any x\ G H', and X2,x^ G H, three elements x\ + g± + z\ G 
A, X2 + 52 G B, and 23 + g% G C, form a triangle iff x\ + x 2 + X3 = Z\, since, 

(xi +g\ + z\) + (x 2 + 92) + (X3 + 93) = xi + x 2 + rc 3 + zi. 

Thus, in order to count the triangles in the three cosets, we count all such triples xi,x 2 ,X3. 



No. of triangles = E A(x\ + g\ + z 1 )B(x 2 + 52)C(x 3 + 53) 



Xl+X2+X3=Zl 



J2 A»*«(zi)B»(x 2 )C2(x3) 



XiGH' ,X2,Xs£H 
Xl+X2+X 3 = Zl 



E E ^ +2l («)5|(/3)cf(7)x a (xi)x^(x2)x 7 ( 



x 3j 
X!eH',X2,X3eH a£H } ,p,j€H 

Xl+X2+X3=Zi 

E E ^+ 7i («)^|(/3)c|( 7 ) Xq (x 1 )x / 3(x2)x 7 (xi + x 2 + Zl ) 

X!eH',x 2 eH azHifi^H 

\H\\H'\ E ^? 1 («)^|(/3)C|(7) E / [ Xa+7 (x 1 ) X/3+7 (x2)x 7 (^i)] 

— ; — _ xiG-H 

a<=H',/3,jeH x 2 eH 



\H\\H'\ Y, A%+ z ^a)B%(P)C%(P) E^uW 

a<=H',/3<=H 

\H\\H'\ ^ £ J^(a)£|G8)^G9)X/j(*i) 

|tf||tf'| J] i|? ri (a)5|(a + r ? )c|(a + r ? ) Xa+ ,(z 1 ) 

aeif',?76H'- L nH, 



Next we use the above lemma to show that if the number of triangles between three sets is small, 
then we can shatter at least one of them. 

Lemma 4.5 (Shattering with a Superregular Part) Let H' ^ H ■< G. We have 51,52,53 S 
G such that 51 + 52 + 53 = 0, and z\ € H. Suppose we are given three sets A,B,C C G, such that 

\An(H' + 9l + Zl )\ \BD(H + g 2 )\ \Cn(H + g 3 )\ 

w\ =di ' — \h\ — =d2 ' — \w\ — =d3 - 

Also assume that the indicator function of A restricted to H' + g\ + Z\ is -^-superregular on 
H' + gt + zi. 

Then, either there are at least -rdid2d3\H\\H'\ triangles between A n (H' + 51 + z\), B n (H + 52), 
and C n (H + 53), or else, H' ( 1 /20, 3 /4, log 2 \ h /h'\)- shatters either B on H + 52, or C on H + 53. 

Proof: We first use Lemma 14.41 to count the triangles between A n (if' + 51 +zi),Bn (iJ + 52), 
and CC\ (H + g 3 ). 

No. of triangles = |iT| |iJ'| ^ Ag, +2l (a)B§ (a + 5)C?f (a + 7/)x a +^(«i). 

aeH',rieH'- L nH 

= \H\\H'\ I Y, ^(0)5|(5«(5)x,^i) 

+ J2 ^(a)B§(a + n)C§(a + V ) Xa+ r l (zi) 

a£H',rieH' ± nH,a^0 

>\H\\H'\U Y B§(v)C§(ri)Xv(zi) 



did 2 d 3 



4 _ 

a£H',r]&H' ± nH,a^0 



Y B^(a + r,)C 9 H 3 (a + V ) X a +v (z 1 )), 



where the last inequality uses the fact that the indicator function of A restricted to H' + g\ + z\ is 
^-superregular on H' + g 1 + z 1 , and that A 9 ^ Zl (0) = d\. 



> 



Using the Cauchy-Schwarz inequality, we get that the second term in the bracket is at least 
— \d\d 2 d 3 \Jd 2 d 3 > —\d1d2d3. Thus, if we have fewer than jdid 2 ds\H\\H'\ triangles, we must 
have that the first term in the bracket is at most \d\ d 2 d 3 . This implies that, 

£ Sgfo)^(»7)*,(*i) < ^- 

rteH ,A -r\H 

We prove the following lemma, which allows us to deduce that H' shatters either B on H + g 2 , or 
C on H + 33. A proof has been included later in the section. 

Lemma 4.6 (Fourier shattering) Let H' < H < G. Given two functions f and g from H to 
M>o that satisfy 

for some positive d\,d 2 and z\ £ -ff; define the function f{v) = E y< =H' [f( v + y)] an d g{v) = 
E yeH> [g(v + y)] , where v G H . Then, either F v [f(v) < |dj > i, or P„ [<j(u) < \d 2 ] > ^. 

Assuming this lemma, and applying it to the functions -B]^ and Cf^, we can deduce that for the 
functions B%(v) ^ E yeH , [B(v + y + g 2 )] = \ B ^+^)\ and ^ (y) def ^^ [c{y + y + ^ = 

I cn (^fl3+«)| ; where w g HC\H' L , we have either P„ [sf («) < fcfcl > ^ or P„ [cf (u) < |d 3 
2Q. This means that iJ' (V 20 ; 3 / 4 > log |^/iT'|)-shatters either B on H + g 2 or C on H + g%. 

We can now give a proof of the main lemma. 

Proof: (of Lemma \4-3\ l- Let p = -^j- 2 -- Apply Corollary 13.31 with subgroup H, element gi, 
regularity parameter p, density parameter ^1/2, and the set A n (il + 51), to partition A n (il + 51) 
into /3-superregular parts. Let {(Ai,Hi,Zi)Y i=1 be the triples returned by Corollary 13.31 satisfying 
\ H /Hi\ < 2 log ( 1 +p) (2/dl) , Ai <Z Hi + Zi + gi, \Ai\ > di/ 2 \Hi\ , and that the indicator function of A4 
restricted to Hi + z% + g\ is /5-superregular on Hi + Zi + g\. 

Fix a triple (Ai,Hi,Zi). If i/j (V 2 o, 3 /4, log 2 |^/^|)-shatters A on H + g 2 or H + g 3 , it also 
(Y20, 3 /4, log 1+y0 2 /di)-shatters it and we are done. Otherwise, applying Lemma 14.51 to the sets 
Ai,A n (H + g 2 ), and A n (i? + 53), on the cosets Hi + Zi + <7i,-£f + 52, and ff + 53, respec- 
tively, there must be at least jjjf\d 2 ds\H\\Hi\ = jd 2 ds\Ai\\H\ triangles between Ai,An (H + g 2 ), 
and An (H + g 3 ). 

Repeating the above argument for every triple, assume we do not find a subset H' that 
( 1 /20, 3 /4, log 1+p 2 /di)-shatters A on at least one oi H + g 2 or H + g%. Then, since the sets {Ai}i 
are disjoint and for all i, Ai C A , the total number of triangles between A n (i7 + gi), A n (i7 + 52)1 
and ^4 n (i? + 53) is at least 

J2 -d 2 d 3 \Ai\\H\ = -d 2 d 3 \H\ J2 \Ai\ > -d x d 2 d 3 \H\ 2 , 

i=l ' i=l 



where the last inequality follows because {^4j}* =1 are disjoint subsets satisfying \(AC\(H + gi))\(AiU 
f |#|, and hence Y*\M ^ \ Ar] ( H +9i)\-^t\H\ > f 



■■■UA t )\ < fy\H\, and hence Y^i\ A i\ > \ A ^(H+gx)\-^\H\ > fy\H\. This completes the proof. 
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We now give a proof of Lemma 14.61 

Proof: (of Lemma \4-6\) . We can simplify the above expression as follows: 

Y /(^WXryOl) = TJjH Y Y f{ X l)Xr ) {xi)g{x 2 )Xr ) {x2)Xr,{zi) 

r)£H' x nH x u x 2 eH n£H ,J -nH 

= \H\\H'\ S f( x i)9(x2) 

xi,X2&H 

Xl+X'2+ZieH' 



(Since x\ + x 2 + z\ E H) 

tttttt Y f{ x i)9{xi + yi + zi) 



\H\\H' 

x 1 eH,y 1 eH' 



Y f(v + y2)g{v + y2 + yi + zi) 



\H\\H' 

' yi,V2&H' ,veHr\H' 

I fj'\ 

\-tl v — ■v _, 

> E 



I -Hi ^ , j/aeH' 
ueffnH'- 1 - 

l vr'\ 

\-tl v — ■v _, 

\ — i > E 

-H" ^ . weff' 
ueffnK'- 1 - 



/(« + ?/2) E [g(v + y 2 + yi + zi' 



/(w + w) e Mv + yx + zi)] 

yiGH' 



(Since for y\ uniform over H , y% + y 2 is also uniform over iJ ) 
iir| £ E \f(y2 + v)] E M Vl + V + Zl )] 






veHHH 



e [7(«)ff(« + ^; 

mm 1 



Thus, 



e , [7(,)^ + , 1 )]< aV/ ' ! 



veHnH' 1 - ' ' 2 

We now claim that either P v [f(v) < |di] > ^, or P„ [<7(u) < jd 2 ] > ^. This holds because 
otherwise, since f(v),g(v) > 0, 

(1 \ o o 01 1 

1 - 2 • - • ~d X ■ -d 2 = — did 2 > -d!d 2 , 
20/ 4 4 160 2 

which is a contradiction. ■ 



5 Proof of the main theorem 

The proof of Theorem 11.21 will follow by repeated applications of the following lemma. 

Lemma 5.1 (Main Lemma) Let A C G be a set that is a union of eN disjoint triangles. Suppose 
we have partitioned G into cosets of a subgroup H, and let T = \ g /h\. If A contains less than 
Q^rpi N 2 triangles (not necessarily disjoint), then there is a subgroup H' < H such that: 

1. T A (G,H')>T A (G,H) + ^. 
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2. \Gjw\ < c T , where c = 2 2log ^ 2 /^ (i/e) . 



Proof: First, remove those elements from A which belong to cosets of H, where A has density 
less than £ /2. Let A' be what is left from A. Notice that the number of elements removed in this 
process is at most eN/2, and hence A' contains at least eN/2 disjoint triangles. 

Let 5i,52)53) be a triangle in A', i.e., 51+52+53 = 0, and for i = 1,2,3, let d, be the density 
of A' in the coset H + g { . Note that di,d 2 ,d 3 > e/2. Since A' contains at most §^ < d ^ d s \H\ 2 
triangles, by Lemma EOl there is a subgroup H\ ■< H such that H\ ( l /20,^/4,, log 1+e 2 / 16 4 /e)-shatters 
A' on at least one of H + 51 or H + 52. 

For each coset H + g that can be shattered, identify any one subgroup that (V 20 ) 3 / 4 > l°gi+ e 2 /i6 4 A)~ 
shatters A' on H + 5. Let if' be the intersection of all these subgroups. For a coset H + g that is 
not shattered, we have Fa{H + 5, iJ') — Ta(H + 5, #) > (Lemma 12.61 P ar t 2). For every coset 
H + g such that A! is shattered on H + 5, observe that j4 n (-ff + 5) = A' n (H + 5), and hence 
Lemma 14.21 implies, 

F A {H + 5, fl 7 ) - J^(i? + g,H)= F A ,{H + 5, fl"') - ? A ,(H + 5, ff) 

^ 1 / 3 | J$\\ \A>^{H + g)\ > 1 \A'n(H + g)\ 



20 V 4 " V4// \H\ ~ 600 |iJ| 

Since A' contains at least eN/2 disjoint triangles, and at least one element from each of these 
triangles is contained in a coset that is shattered, at least eN/6 of the elements of A' belong to a 
coset that has been shattered. Thus, averaging over all the cosets of H, 

F A {G,H')-F A {G,H)>-±-- £ ~ 



Note that 
and hence, 



600 6 3600 

16 ( 4 A) 



\H/ H '\ < 2 los n- 2 /i 
\g/h'\ < T2 Tlogl +- 2 /w (4/£) < 2 2Tlog i+- 2 /i6 (4/£) = c T . 



Now we show how to deduce the main theorem using the above lemma. 

Proof: (of Theorem \1.2\ . Let 5 be such that <5 -1 is a tower of twos of height 0(log 1 /e) (the 
constant in G will be specified later). Assume for contradiction that A is e-far from being triangle- 
free, and has less than 5N 2 triangles. Thus, more than eN elements have to be removed from A to 
make it triangle-free. Consider a maximal set of disjoint triangles in A, of say £0-^ triangles, and 
let A' C A be the union of these triangles. Thus, \A'\ = 3eqN. From now on, we will only work 
with A'. Since A\A' must be triangle-free, 3eo^V > eN, i.e., £0 > £ / 3 - 

Since A has at most 5N triangles, A' also has at most 5N triangles. Now, we repeatedly apply 
Lemma 15.11 in order to find successively finer partitions of G with increasing mean entropies. For 

dcf 

i = 0, start with the trivial partition according to cosets of Hq = G for which, 

1. the number of parts is To = \ g /g\ = 1, and, 

2. the mean entropy density of the partition is Fa'{G, Hq) = 2>Eq log 3&o- 
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3 

At step i, if 5N 2 < ° 2 N 2 , then we can apply Lemma EH to refine the partition according to a sub- 
group H i+1 < Hi, such that F A ,{G,H i+1 ) > F A ,{G,Hi) + ggjg. Moreover T i+1 < 2 mi ° gl +4/ ie{4/£o) . 
Since <5 _1 is a tower of twos of height ©(log !/e) = fi(log V s o)> we can P^k the constant inside the 



dof 



large enough so that the condition 5 < -^^ is satisfied for alii < t = [12000 log V 3e ol • This implies 

J^A'iGjHt) > 3eolog3eo + gfoo ' 12000 log t^- > 0. However, we must always have FA'{G,H t ) < 
(Lemma 12.61 part 1), and hence this is a contradiction. ■ 
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